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This paper unifies the concepts of evolutionary games and quantum strategies. First, we state the formulation 
and properties of classical evolutionary strategies, with focus on the destinations of evolution in 2-player 2- 
strategy games. We then introduce a new formalism of quantum evolutionary dynamics, and give an example 
where an evolving quantum strategy gives reward if played against its classical counterpart. 
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Game theory ifTTI is a theory of decision making, which 
provides mathematical tools for investigating situations in 
which several parties make decisions according to their own 
interests. In a game, each player aims at maximizing their 
utilities or payoffs by choosing the best choice of strategies 
they can offer. The last few decades witnessed several new 
concepts of game theory, such as quantum games and evo- 
lutionary games, attracting much attention among not only 
game theorists, but also mathematicians, quantum physicists, 
economists, biologists and computer scientists. 

Quantum games proposed a new perspective for the solu- 
tion of game theory. In a classical game, players draw a 
mixed strategy from a probability distribution p. To modify 
ones strategy, each player i is allowed to apply linear trans- 
formation among her own part of probability vector p im In 
quantum games, however, players employ quantum strategies 
instead of mixed strategies, over a larger set of quantum states 
p in Hilbert space H. It has been shown that quantum strate- 
gies lead to advantages over classical strategies in some par- 
ticular game examples. For instances, Meyer [6] quantized a 
coin tossing game and discovered that one may use quantum 
strategy against his opponent's classical strategy to win a high 
utility with certainty. Eisert et. al. |2 ] proposed a new model 
for 2-player 2-strategy games with entanglement by quantiz- 
ing prisoner's dilemma. Zhang fT3l modified their model and 
allowing arbitrary admissible local operations to each player, 
and addressed a quantitative study on the increase of equi- 
librium payoffs in quantum games. All previous results have 
been restricted to static games, without considering any form 
of dynamical strategy evolution. 

Evolutionary game theory (8j|9l[T2]|, innovated by the study 
of biological evolution, does not rely on the assumption of 
completely rational players but on the idea that Darwinian 
process of natural selection [3 ] drives biological systems to- 
wards optimization of future reproductive success |5|. John 
Maynard Smith (HJ was the first geneticist who tried to model 
the dynamics of biological systems in terms of game theory. 
He introduced a new concept called evolutionary stable strat- 
egy to explain the distribution of different phenotypes within 
biological populations |9|. Generally, given the set of differ- 
ential equations of strategy dynamics, the equilibrium states 
can be determined as the fixed points of the dynamical sys- 
tem. One particular model describing how populations evolve 
is called replicator dynamics, whose fixed points are found to 
be Nash equilibria Q. 



To begin with, we briefly review the notion of classical evo- 
lutionary game by the following game example. Assume two 
populations, Assyrian and Babylonian, are going to decide 
whether to trade (T) or to farm (F) for their livings. Gener- 
ally trading makes good profit, but both populations need to 
cooperate (both choose T) in order to make successful trad- 
ing, otherwise the one who performs trading earns nothing. 
The payoffs to players according to their strategies pair are 
determined by payoff matrices A and B (expression [TJ. No- 
tice that the game is a symmetric game so payoff matrix B is 
the transpose of A. 
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Now Assyrian and Babylonian play the game repeatedly. In 
each stage, they have to decide how to distribute their people 
into a mixed proportion of trading and farming. We called 
their distributions of people as two mixed strategies x and y, 
each is represented by a 2-dimensional probability vector over 
strategies T and F. The game is equivalent to a finite- strategy 
game that each player chooses strategy T or F according to 
probability distribution x and y. The expected payoff of each 
player is a bilinear function of x and y, given by u A = x T Ay 
and u B = x 1 By. 

A strategy pair (x,y) is represented by a point in a 2- 
dimensional affine subspace S in [0, l] 4 . Assuming that the 
time step between two stage games is infinitesimally small and 
the game is played infinitely many times, the mixed strategies 
x(t) and y(t) can be treated as continuous vector functions 
of time t. The product strategy is a time-dependent vector 
pit) = x(t) x yit) in the strategy space <S, which trajectory 
draws an evolution path of their mixed strategy distribution. 

In this Trading-Farming game, here are 3 dynamical equi- 
libria: both players choose T, both players choose F, or both 
choose T with probability 1/2. We called the first two states as 
pure equilibria (PE), and the third one as internal equilibrium 
(IE), which is an equilibrium probability vector with no zero 
entry. Since these 3 strategy pairs lead both players reach- 
ing local optimal payoff, they would not have the intention to 
modify their strategies once they reached one of the equilib- 
rium states. On the contrary, if they are not using equilibrium 
strategies in current stage, there exists an alternative strategy 
for each player to earn more. As to maximize future payoffs, a 
strategy evolves with time subject to payoff difference. There- 
fore, the strategy evolution can be captured by certain kind of 
evolutionary dynamics. 
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One of the most commonly mentioned evolutionary dynam- 
ics is called replicator dynamics, which was well-established 
by Bomze (1983) and applied to explain animal evolution be- 
haviors 1 1 ]. Formally, following replicator dynamics, the evo- 
lution of Assyrian's and Babylonian's strategy is determined 
by the following differential equations: 



Payoff contour (Trading- Farming) 



dt 
dyi = 
dt 



jXi((Ay)i -x T Ay) 
-- 1 y l ((x T B) l -x T By) 



Intuitively, if a pure strategy (say T) yields a better payoff 
compared to the current mixed strategy x, Assyrian will grad- 
ually expand the probability share of using T in proportion to 
the payoff difference, where 7 is the rate of evolution. 

Below are some general properties of classical 2-player 2- 
strategy evolutionary games. Generally, with payoff matrices 



and B 



A= a \ 
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strategies x = (x, 1 
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let the players employ mixed 



y), so Assyrian 



will choose T with probability x and Babylonian will choose 
T with probability y. For every strategy pair (x,y), we let 
v(x,y) be a velocity vector function subject to replicator dy- 
namics. For any 2-player 2-strategy game, there is at most 
1 internal equilibrium. An internal equilibrium is uniquely 
determined to be (x*,y*) = ( a ,_ffl^ +d , , a J b Z b c+d ) ^ it ex- 
ists. Based on its position, the probabilistic strategy space 
can be divided into 4 quadrants, with the internal equilibrium 
(x*,y*) locates at centre. We claim that for any two internal 
points within the same quadrant, the sign of each component 
of their velocity vectors should be the same. For any two in- 
ternal points and located in different quadrants, their velocity 
vectors have at least one component different in sign. These 
properties can be shown by the bilinearity of the expected pay- 
off function, where the payoff has to be monotonic increasing 
or decreasing if we fix the strategy of one player. Thus it is 
obvious that for any 2-player 2-strategy game without inter- 
nal equilibrium, an evolving strategy will always collapse to 
a state on boundary, which means at least one player ends up 
with a dominant pure strategy by the end of evolution. 

Next, we put forth our discussions to symmetric games. For 
any 2-player 2-strategy symmetric game, the payoff matrices 
can be expressed in 4 parameters as below: 
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In particular, our previous Trading-Farming game has param- 
eters a > c = d > b, and the famous Prisoner-Dilemma 
game is another example with b > d > c > a. A game 
with a > c and b > d (or a < c and b < d) has no mixed 
equilibrium and only one pure equilibrium, then the strategy 
evolution will always converge to that pure equilibrium. For 
symmetric games with single internal equilibrium, we classify 
them into two categories. One category (type I) includes sym- 
metric games like Hawk-Dove, which have parameters a < c 
and b > d. Another category (type II) includes symmetric 




FIG. 1 . A contour plot of strategy space of a Trading-Farming game. 
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FIG 2. Type I and Type II symmetric game. 

games like Trading-Farming, which have parameters a > c 
and b < d. 

In a type I symmetric game, assuming both players follow 
same evolutionary dynamics, if they start at same initial mixed 
strategy, their strategies evolve towards the internal equilib- 
rium. Otherwise, the player with larger initial share of strat- 
egy T ultimately converges to pure strategy T and the other 
player converges to pure strategy F. One can deduce the above 
properties by the fact that an evolving strategy never moves 
across the diagonal x — y'm the 2-dimensional strategy space, 
due to the symmetric property of evolutionary dynamics, i.e. 
v x (x, y) = v y (y, x). Therefore, any unequal strategy pair will 
finally converge to either TT or FF through evolution. 

The result for type II symmetric games is completely differ- 
ent. Let H and D be the choices of strategies, given an internal 
equilibrium (x*,y*), for any point in the region x > and 
y < y* 9 the evolution converges to HD. Within the region 
x < x* and y > y* , the evolution converges to DH. For the 
other two regions, it can be shown that if a strategy at (x, y) 
converges to HD, a point (x — a, y + j3) also converges to HD, 
for all < a < x , < f3 < 1 — y. Similarly, if a strategy at 
(x, y) converges to DH, a point (x+a, y—0) also converges to 
DH for all < a < 1 — x ,0 < (3 < y. Combining the results 
above, given any evolutionary symmetric game G under repli- 
cator dynamics, the strategy space can be divided into 2 open 
regions, where each pure equilibrium lies in each regions, and 
the internal equilibrium lies on the boundary. Then any initial 
strategy inside a region converges to pure equilibrium con- 
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tained in that region, and any strategy located on boundaries 
between regions converges to the internal equilibrium. 

Notice that replicator dynamics is not the only possible dy- 
namical system describing continuous strategy evolution. As 
soon as the evolutionary dynamics satisfies adjustment prop- 
erty ifTOl : jj^x T Ay > 0, the arguments in this letter follows. 

Now we consider quantum evolutionary games which allow 
players to use strategies quantum mechanically. In a classical 
game, each player has a strategy set <S, and the mixed strategy 
distribution is represented by probability vector p = x x y. 
The expected payoffs are given by u A (p) = . xiyjAij and 
u B (p) = J2ij x iUjBij. In a quantum game, each player i 
now has a Hilbert space Hi = span{si : S{ G Si}. A quan- 
tum strategy can be represented by a quantum state = 
Y^ij y/xiyj\ij). Here a quantum state is a generalization of 
classical probability distribution. Given a quantum state 
players receive their payoffs by performing a quantum mea- 
surement on computational basis S, where the payoff function 
of each player is defined to be u a (\i(j)) = Y^ij K^l^)! 2 ^ 
and u B (\ifj)) = Kul / 0)| 2 ^ij- m evolutionary games, a 
classical player can shuffle the probability distribution over 
her strategy space, a quantum player can perform operation 
on quantum state by applying admissible super-operators 
in her local space Hi . 

Since classical and quantum strategies locate in different 
strategy space, their evolutionary behaviours would be possi- 
bly different. Zhang fT3l shows that a quantum equilibrium 
strategy always induces a classical equilibrium distribu- 
tion through measurement. More precisely, if \ip) is a quan- 
tum Nash equilibrium, then p defined by p(s) = \(s\ifj}\ 2 is 
essentially a classical Nash equilibrium. Since Nash equilib- 
rium is a sufficient condition for a strategy to remain station- 
ary under replicator dynamics [4], the above theorem implies 
that a quantum equilibrium state |^*) always implies a classi- 
cal evolutionary fixed point at corresponding position p* given 

byp(s) = |<^#>| 2 . 

Now we use quantum replicator dynamics as an example to 
illustrate the evolution of non-equilibrium strategy. For clas- 
sical replicator dynamics, a player tends to increase the prob- 
ability share X{ of a particular strategy i only when the payoff 
of pure strategy Si is higher than the payoff of mixed strategy 
x. The rate of change of X{ is proportional to the net increase 
of payoff and its share following the expression below: 



^ — ^2jk x jVk(Aik Ajk) 

Xi(u A (si,y) - u A (x,y)) 



(3) 



Assume that Babylonian uses mixed strategy y, here we de- 
note the expected payoff of Assyrian using pure strategy Si 
to be u A (si, y) = ^ AikVk » an d its expected payoff using 
mixed strategy x to be u A (x, y) = J2jk Ajk%jVk- We name 



the function v) 



y) be the partial veloc- 



ity of evolutionary strategy s$. Then we denote the diagonal 



matrix with (z, i) 
Therefore, f t x(t) = V^x(t). 



,0*0 



to be the velocity operator. 




FIG. 3. Classical (left) and quantum (right) replicator dynamical 
evolution in Trading-Farming game. 

Consider the evolution of joint strategy p = x x y, since 
f t p = x X j- t y + f t x X y = (I (g) + V (x) <8> I)p = Vp, 
where V = I <g> + (g) I is called the joint velocity 
operator which is also a diagonal matrix. 

With analog to classical regime, the evolution of quantum 
strategy also depends on payoff differences between the target 
and current strategies. In this case, the quantum partial veloc- 
ity is given by v\ x>} = u A (\i)\i/j y )) — u A (\ij))), where 
\ip y ) = Tr x (\ip)). In quantum mechanics, the evolution of a 
quantum state is generated by a Hamiltonian H, and the rate 
of change of a state is described by Schrodinger equation: 



*4i 

dr 



(4) 



Here the quantum generating function of evolution is given 
by —iH/H. Following replicator dynamics, given a quantum 
strategy the Hamiltonian generator of Assyrian fol- 
lowing quantum replicator dynamics is defined as below. 

H (x \m = j2-i(u A (m v )) -^iv^)) w„>ii>oi 

Then the product Hamiltonian H = I g) + g) I 
drives quantum strategies of players to evolve, which itself 
is state-dependent and thus time-dependent during evolution. 
For any later time t > 0, the quantum strategy p(t) evolves 
based on the initial state |^(0)) and Hamiltonian generator 
H(\ip)) according to the following: 

\m) = u(o,t)\m) = e -/o*<»ci*(*')»<«'|^(o)) ( 5 ) 

Quantum replicator dynamics is only applicable on quan- 
tum strategy but not classical strategy. Different from its 
classical counterpart, a quantum strategy may evolve dras- 
tically even if its classical counterpart is a dominant pure 
equilibrium. For example, consider a 2-player symmetric 

fime G with payoff matrices A and B define as expression 
In classical regime, let x(t) = (cos 2 sin 2 9) T and 
y(t) = (cos 2 (j) sin 2 <j)) T be the mixed strategy of Assyr- 
ian and Babylonian use at time t. For initial strategies x(0) = 
(cos 2 #0 sin 2 # ) T and y(0) = (cos 2 </> sin 2 (/> ) T . Un- 
der classical replicator dynamics, a strategy evolve with time 
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as below. 



x(t) = x (0) 



jS((j)) sin 2 



' cos 



1 

-1 



dt' (6) 



where 6(0) = (a - c) cos 2 + (b - d) sin 2 and <S(0) = 
(a — c) cos 2 + (6 — d) sin 2 0. The above expressions are 
deduced by substituting ^ = -5(</>) sin 20 /A and ^ = 
—5(0) sin 20/4 into the replicator dynamics expression. 

On the other hand, players may employ quantum strate- 
gies following quantum replicator dynamics. Let = 
(e ia cos e~ ia sin 0) T and \() = (e ia cos (j) e~ ia sin (j)) T 
be the quantum strategies of Assyrian and Babylonian respec- 
tively. Their strategy evolutions are governed by the Hamilto- 
nian generators Ha and Hb, where 

Ha(H)) = 5>W>(« A (|a>IC» " u A (\b)\C)))\a){b\ 



Therefore, 

W)) 



■■fo iH ^ dt '\^{t)) 



|V(0)> 



-e smf 
e ia cos 



dt' (7) 



Comparing equations [6] and |7j we see that quantum replica- 
tor dynamical evolution is different from its classical counter- 
part. One significant difference is that, the classical strategy 
will converge whenever or </> — >> (which leads to a classi- 
cal pure strategy), since sin 2 = leads to a static evolution; 
but the quantum strategy keeps evolving in the complex space 
even if or <j> — >> 0. 

Particularly, in Prisoner-Dilemma game, a pair of classi- 
cal evolving strategies always converge to pure strategy De- 
fect, but a pair of evolving quantum strategies keep oscillating 
within a cycle and never converge (See figure 4). Such a dif- 
ference is due to the Hilbert space of quantum strategy form a 
spherical manifold instead of an affine 2D-plane with borders. 
When a quantum strategy vector reaches pure strategy Defect, 
it keeps moving towards the negative quadrant and never stops 
at s. This makes the quantum state evolving in a circular loop 
and never converges to any equilibrium. 

Previously we consider both players playing quantum 
evolving strategies. One may be curious if quantum dynam- 
ics provides us some advantages in a unfair game against its 
classical counterpart. In iterated Prisoner-Dilemma based on 
replicator dynamics, quantum does not provide us any bene- 
fit. On the other hand, in Trading-Farming game, regardless 
of its less advantageous initial position, employing a quan- 
tum replicator dynamics ensures quantum player (Assyrian) 
to achieve a better payoffs compared to classical replicator 
dynamics (Refer to figure [3] and [4]). 
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FIG. 4. Classical vs Quantum replicator dynamical evolution 
in Prisoner-Dilemma (left) and Trading-Farming (right). Assyrian 
employs quantum evolving strategy, Babylonian employs classical 
evolving strategy. 
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